Overview

Dataset statistics

Number of variables27
Number of observations61001
Missing cells160335
Missing cells (%)9.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory12.6 MiB
Average record size in memory216.0 B

Variable types

CAT17
NUM7
DATE2
UNSUPPORTED1

Reproduction

Analysis started2020-05-12 05:26:58.650237
Analysis finished2020-05-12 05:27:11.457941
Duration12.81 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

country has constant value "USA" Constant
admin_fee has constant value "20.0" Constant
state_fee has constant value "10.0" Constant
inspector_name has a high cardinality: 116 distinct values High cardinality
violator_name has a high cardinality: 38515 distinct values High cardinality
violation_street_name has a high cardinality: 1477 distinct values High cardinality
violation_zip_code has a high cardinality: 58 distinct values High cardinality
mailing_address_str_number has a high cardinality: 9703 distinct values High cardinality
mailing_address_str_name has a high cardinality: 16851 distinct values High cardinality
city has a high cardinality: 3266 distinct values High cardinality
state has a high cardinality: 58 distinct values High cardinality
zip_code has a high cardinality: 2900 distinct values High cardinality
violation_code has a high cardinality: 151 distinct values High cardinality
violation_description has a high cardinality: 163 distinct values High cardinality
late_fee is highly correlated with fine_amountHigh correlation
fine_amount is highly correlated with late_feeHigh correlation
violation_zip_code has 36977 (60.6%) missing values Missing
mailing_address_str_number has 1014 (1.7%) missing values Missing
non_us_str_code has 61001 (100.0%) missing values Missing
hearing_date has 2197 (3.6%) missing values Missing
grafitti_status has 58780 (96.4%) missing values Missing
violation_street_number is highly skewed (γ1 = 141.0815836) Skewed
discount_amount is highly skewed (γ1 = 26.8565106) Skewed
clean_up_cost is highly skewed (γ1 = 26.06415029) Skewed
ticket_id has unique values Unique
non_us_str_code is an unsupported type, check if it needs cleaning or further analysis Unsupported
fine_amount has 782 (1.3%) zeros Zeros
late_fee has 8054 (13.2%) zeros Zeros
discount_amount has 60239 (98.8%) zeros Zeros
clean_up_cost has 59421 (97.4%) zeros Zeros
judgment_amount has 790 (1.3%) zeros Zeros

Variables

ticket_id
Real number (ℝ≥0)

UNIQUE

Distinct count61001
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean331724.5328109375
Minimum284932
Maximum376698
Zeros0
Zeros (%)0.0%
Memory size476.6 KiB

Quantile statistics

Minimum284932
5-th percentile291042
Q1310111
median332251
Q3353031
95-th percentile371154
Maximum376698
Range91766
Interquartile range (IQR)42920

Descriptive statistics

Standard deviation25434.93214
Coefficient of variation (CV)0.07667486009
Kurtosis-1.139089268
Mean331724.5328
Median Absolute Deviation (MAD)21527
Skewness-0.0383293795
Sum2.023552823e+10
Variance646935773
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2867081< 0.1%
 
3176461< 0.1%
 
3647751< 0.1%
 
3668221< 0.1%
 
3606771< 0.1%
 
3627241< 0.1%
 
3729631< 0.1%
 
3750101< 0.1%
 
3709121< 0.1%
 
2910351< 0.1%
 
Other values (60991)60991> 99.9%
 
ValueCountFrequency (%) 
2849321< 0.1%
 
2849431< 0.1%
 
2849441< 0.1%
 
2849451< 0.1%
 
2849461< 0.1%
 
ValueCountFrequency (%) 
3766981< 0.1%
 
3766381< 0.1%
 
3766241< 0.1%
 
3766231< 0.1%
 
3766221< 0.1%
 

agency_name
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
Department of Public Works
40731
Buildings, Safety Engineering & Env Department
16832
Detroit Police Department
 
3438
ValueCountFrequency (%) 
Department of Public Works4073166.8%
 
Buildings, Safety Engineering & Env Department1683227.6%
 
Detroit Police Department34385.6%
 

Length

Max length46
Median length26
Mean length31.46223832
Min length25

inspector_name
Categorical

HIGH CARDINALITY

Distinct count116
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
Zizi, Josue
 
6293
Lusk, Gertrina
 
2744
Snyder, Derrell
 
2638
Forte, Laurie
 
2190
Tidwell, Rhonda
 
2165
Other values (111)
44971
ValueCountFrequency (%) 
Zizi, Josue629310.3%
 
Lusk, Gertrina27444.5%
 
Snyder, Derrell26384.3%
 
Forte, Laurie21903.6%
 
Tidwell, Rhonda21653.5%
 
McCants, Angela20503.4%
 
Carver, Gharian19783.2%
 
Buchanan, Daryl18763.1%
 
Addison, Michael17692.9%
 
Frazier, Willie15732.6%
 
Other values (106)3572558.6%
 

Length

Max length22
Median length15
Mean length14.11237521
Min length10

violator_name
Categorical

HIGH CARDINALITY

Distinct count38515
Unique (%)63.2%
Missing28
Missing (%)< 0.1%
Memory size476.6 KiB
HOMES LDHA LP, MLK
 
91
WEEKS, DANA
 
82
PROPERTIES, LLC, KAY BEE KAY
 
60
MAE, FANNIE
 
55
FELLOWSHIP ESTATES LLC, -
 
54
Other values (38510)
60631
ValueCountFrequency (%) 
HOMES LDHA LP, MLK910.1%
 
WEEKS, DANA820.1%
 
PROPERTIES, LLC, KAY BEE KAY600.1%
 
MAE, FANNIE550.1%
 
FELLOWSHIP ESTATES LLC, -540.1%
 
DET 123 FUND LLC480.1%
 
ARTESIAN EQUITIES LLC, -420.1%
 
& HERBERT STRATHER, FELLOWSHIP ESTATES LLC C/O WENDELL ANTHONY390.1%
 
ARTESIAN EQUITIES LLC380.1%
 
SUMMIT ACQUISITIONS LLC350.1%
 
Other values (38505)6042999.1%
 

Length

Max length109
Median length18
Mean length20.09504762
Min length3

violation_street_number
Real number (ℝ)

SKEWED

Distinct count13999
Unique (%)22.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12566.383829773282
Minimum-15126.0
Maximum20106114.0
Zeros2
Zeros (%)< 0.1%
Memory size476.6 KiB

Quantile statistics

Minimum-15126
5-th percentile1149
Q16008
median12134
Q317165
95-th percentile20050
Maximum20106114
Range20121240
Interquartile range (IQR)11157

Descriptive statistics

Standard deviation141437.2564
Coefficient of variation (CV)11.25520741
Kurtosis20033.50134
Mean12566.38383
Median Absolute Deviation (MAD)5531
Skewness141.0815836
Sum766561980
Variance2.000449751e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16700550.1%
 
18600470.1%
 
1600440.1%
 
2401390.1%
 
5900390.1%
 
12000390.1%
 
18500380.1%
 
20400380.1%
 
7601370.1%
 
15700360.1%
 
Other values (13989)6058999.3%
 
ValueCountFrequency (%) 
-151261< 0.1%
 
-118711< 0.1%
 
-110641< 0.1%
 
02< 0.1%
 
111< 0.1%
 
ValueCountFrequency (%) 
201061143< 0.1%
 
20106141< 0.1%
 
12191851< 0.1%
 
8901091< 0.1%
 
2000001< 0.1%
 

violation_street_name
Categorical

HIGH CARDINALITY

Distinct count1477
Unique (%)2.4%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
MCNICHOLS
 
1125
SEVEN MILE
 
1114
GRAND RIVER
 
1031
GRATIOT
 
894
WARREN
 
869
Other values (1472)
55968
ValueCountFrequency (%) 
MCNICHOLS11251.8%
 
SEVEN MILE11141.8%
 
GRAND RIVER10311.7%
 
GRATIOT8941.5%
 
WARREN8691.4%
 
LIVERNOIS6281.0%
 
MICHIGAN AVE5530.9%
 
JOY RD4540.7%
 
FENKELL4370.7%
 
ASHTON4370.7%
 
Other values (1467)5345987.6%
 

Length

Max length17
Median length8
Mean length7.842510779
Min length3

violation_zip_code
Categorical

HIGH CARDINALITY
MISSING

Distinct count58
Unique (%)0.2%
Missing36977
Missing (%)60.6%
Memory size476.6 KiB
48228
 
2648
48227
 
2006
48224
 
1991
48234
 
1916
48219
 
1748
Other values (53)
13715
ValueCountFrequency (%) 
4822826484.3%
 
4822720063.3%
 
4822419913.3%
 
4823419163.1%
 
4821917482.9%
 
4823516452.7%
 
4820513432.2%
 
482219811.6%
 
482389541.6%
 
482238531.4%
 
Other values (48)793913.0%
 
(Missing)3697760.6%
 

Length

Max length5
Median length3
Mean length3.787593646
Min length3

mailing_address_str_number
Categorical

HIGH CARDINALITY
MISSING

Distinct count9703
Unique (%)16.2%
Missing1014
Missing (%)1.7%
Memory size476.6 KiB
4
 
630
1
 
391
484
 
375
3
 
274
3233
 
267
Other values (9698)
58050
ValueCountFrequency (%) 
46301.0%
 
13910.6%
 
4843750.6%
 
32740.4%
 
32332670.4%
 
PO BOX2530.4%
 
722260.4%
 
P.O. BO2130.3%
 
91840.3%
 
184811700.3%
 
Other values (9693)5700493.4%
 
(Missing)10141.7%
 

Length

Max length10
Median length4
Mean length3.746823823
Min length1

mailing_address_str_name
Categorical

HIGH CARDINALITY

Distinct count16851
Unique (%)27.6%
Missing3
Missing (%)< 0.1%
Memory size476.6 KiB
GRAND RIVER
 
479
P.O. BOX
 
452
PO BOX
 
237
GRATIOT
 
201
GREENFIELD
 
188
Other values (16846)
59441
ValueCountFrequency (%) 
GRAND RIVER4790.8%
 
P.O. BOX4520.7%
 
PO BOX2370.4%
 
GRATIOT2010.3%
 
GREENFIELD1880.3%
 
LIVERNOIS1870.3%
 
WOODWARD1770.3%
 
MACK1690.3%
 
W MCNICHOLS1690.3%
 
SCHAEFER1650.3%
 
Other values (16841)5857496.0%
 

Length

Max length44
Median length9
Mean length10.08114621
Min length1

city
Categorical

HIGH CARDINALITY

Distinct count3266
Unique (%)5.4%
Missing1
Missing (%)< 0.1%
Memory size476.6 KiB
DETROIT
26358
Detroit
 
4168
SOUTHFIELD
 
2466
DEARBORN
 
1808
FARMINGTON HILLS
 
773
Other values (3261)
25427
ValueCountFrequency (%) 
DETROIT2635843.2%
 
Detroit41686.8%
 
SOUTHFIELD24664.0%
 
DEARBORN18083.0%
 
FARMINGTON HILLS7731.3%
 
WEST BLOOMFIELD7001.1%
 
Southfield4470.7%
 
detroit4380.7%
 
BLOOMFIELD HILLS4360.7%
 
TROY4340.7%
 
Other values (3256)2297237.7%
 

Length

Max length44
Median length7
Mean length8.36604318
Min length1

state
Categorical

HIGH CARDINALITY

Distinct count58
Unique (%)0.1%
Missing331
Missing (%)0.5%
Memory size476.6 KiB
MI
51866
CA
 
1877
TX
 
913
FL
 
863
NY
 
802
Other values (53)
 
4349
ValueCountFrequency (%) 
MI5186685.0%
 
CA18773.1%
 
TX9131.5%
 
FL8631.4%
 
NY8021.3%
 
NV3870.6%
 
SC3500.6%
 
UT2870.5%
 
IL2750.5%
 
OH2410.4%
 
Other values (48)28094.6%
 
(Missing)3310.5%
 

Length

Max length3
Median length2
Mean length2.005426141
Min length2

zip_code
Categorical

HIGH CARDINALITY

Distinct count2900
Unique (%)4.8%
Missing3
Missing (%)< 0.1%
Memory size476.6 KiB
48235
 
2330
48221
 
2289
48228
 
2283
48227
 
2080
48224
 
2064
Other values (2895)
49952
ValueCountFrequency (%) 
4823523303.8%
 
4822122893.8%
 
4822822833.7%
 
4822720803.4%
 
4822420643.4%
 
4821920123.3%
 
4823416732.7%
 
4812615512.5%
 
4807514572.4%
 
4823813402.2%
 
Other values (2890)4191968.7%
 

Length

Max length10
Median length5
Mean length4.978836413
Min length1

non_us_str_code
Unsupported

MISSING
REJECTED
UNSUPPORTED

Missing61001
Missing (%)100.0%
Memory size476.7 KiB

country
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
USA
61001
ValueCountFrequency (%) 
USA61001100.0%
 

Length

Max length3
Median length3
Mean length3
Min length3
Distinct count33064
Unique (%)54.2%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
Minimum2012-01-04 14:00:00
Maximum2016-12-29 15:00:00
Histogram

hearing_date
Date

MISSING

Distinct count3312
Unique (%)5.6%
Missing2197
Missing (%)3.6%
Memory size476.6 KiB
Minimum2012-01-19 09:00:00
Maximum2017-01-25 13:30:00
Histogram

violation_code
Categorical

HIGH CARDINALITY

Distinct count151
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
9-1-104
16259
22-2-88(b)
15699
9-1-36(a)
8653
22-2-45
 
2844
9-1-111
 
2246
Other values (146)
15300
ValueCountFrequency (%) 
9-1-1041625926.7%
 
22-2-88(b)1569925.7%
 
9-1-36(a)865314.2%
 
22-2-4528444.7%
 
9-1-11122463.7%
 
9-1-110(a)20053.3%
 
9-1-81(a)16042.6%
 
22-2-4314172.3%
 
9-1-11313792.3%
 
22-2-88(a)13092.1%
 
Other values (141)758612.4%
 

Length

Max length20
Median length9
Mean length8.941509156
Min length7

violation_description
Categorical

HIGH CARDINALITY

Distinct count163
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
Excessive weeds or plant growth one- or two-family dwelling or commercial Building
16259
Allowing bulk solid waste to lie or accumulate on or about the premises
15699
Failure of owner to obtain certificate of compliance
8653
Violation of time limit for approved containers to remain at curbside - early or late
 
2844
Failure of owner to remove graffiti or maintain or restore property free of graffiti.
 
2246
Other values (158)
15300
ValueCountFrequency (%) 
Excessive weeds or plant growth one- or two-family dwelling or commercial Building1625926.7%
 
Allowing bulk solid waste to lie or accumulate on or about the premises1569925.7%
 
Failure of owner to obtain certificate of compliance865314.2%
 
Violation of time limit for approved containers to remain at curbside - early or late28444.7%
 
Failure of owner to remove graffiti or maintain or restore property free of graffiti.22463.7%
 
Inoperable motor vehicle(s) one- or two-family dwelling or commercial building20053.3%
 
Failure to obtain certificate of registration for rental property16042.6%
 
Improper placement of Courville container between collections14172.3%
 
Failure to maintain a vacant building or structure in accordance with the requirements of Section 9-1-113 of the Detroit City Code: (1)13792.3%
 
Failure of owner to keep property, its sidewalks, or adjoining public property free from solid, medical or hazardous waste13102.1%
 
Other values (153)758512.4%
 

Length

Max length241
Median length78
Mean length77.69126736
Min length20

disposition
Categorical

Distinct count8
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
Responsible by Default
51602
Responsible by Admission
 
4484
Responsible by Determination
 
4124
Responsible (Fine Waived) by Deter
 
781
Responsible - Compl/Adj by Default
 
6
Other values (3)
 
4
ValueCountFrequency (%) 
Responsible by Default5160284.6%
 
Responsible by Admission44847.4%
 
Responsible by Determination41246.8%
 
Responsible (Fine Waived) by Deter7811.3%
 
Responsible - Compl/Adj by Default6< 0.1%
 
Responsible - Compl/Adj by Determi2< 0.1%
 
Responsible (Fine Waived) by Admis1< 0.1%
 
Responsible by Dismissal1< 0.1%
 

Length

Max length34
Median length22
Mean length22.70808675
Min length22

fine_amount
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count53
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean272.71418501336046
Minimum0.0
Maximum10000.0
Zeros782
Zeros (%)1.3%
Memory size476.6 KiB

Quantile statistics

Minimum0
5-th percentile50
Q150
median200
Q3250
95-th percentile1000
Maximum10000
Range10000
Interquartile range (IQR)200

Descriptive statistics

Standard deviation360.1018552
Coefficient of variation (CV)1.320436834
Kurtosis58.238565
Mean272.714185
Median Absolute Deviation (MAD)150
Skewness4.887642899
Sum16635838
Variance129673.3461
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
501732228.4%
 
2501014016.6%
 
100917615.0%
 
200909314.9%
 
500694411.4%
 
100044527.3%
 
1259321.5%
 
07821.3%
 
7507311.2%
 
25004500.7%
 
Other values (43)9791.6%
 
ValueCountFrequency (%) 
07821.3%
 
206< 0.1%
 
251660.3%
 
302< 0.1%
 
501732228.4%
 
ValueCountFrequency (%) 
100004< 0.1%
 
500025< 0.1%
 
40001< 0.1%
 
35002< 0.1%
 
300015< 0.1%
 

admin_fee
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
20
61001
ValueCountFrequency (%) 
2061001100.0%
 

Length

Max length4
Median length4
Mean length4
Min length4

state_fee
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size476.6 KiB
10
61001
ValueCountFrequency (%) 
1061001100.0%
 

Length

Max length4
Median length4
Mean length4
Min length4

late_fee
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count44
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.116219406239242
Minimum0.0
Maximum1000.0
Zeros8054
Zeros (%)13.2%
Memory size476.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q15
median10
Q325
95-th percentile100
Maximum1000
Range1000
Interquartile range (IQR)20

Descriptive statistics

Standard deviation36.31015513
Coefficient of variation (CV)1.445685537
Kurtosis56.2729687
Mean25.11621941
Median Absolute Deviation (MAD)10
Skewness4.785003551
Sum1532114.5
Variance1318.427366
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
51502324.6%
 
25907714.9%
 
0805413.2%
 
20771012.6%
 
10752812.3%
 
50654710.7%
 
10042987.0%
 
12.58521.4%
 
756861.1%
 
2504470.7%
 
Other values (34)7791.3%
 
ValueCountFrequency (%) 
0805413.2%
 
22< 0.1%
 
2.51520.2%
 
51502324.6%
 
61< 0.1%
 
ValueCountFrequency (%) 
10004< 0.1%
 
50023< 0.1%
 
4001< 0.1%
 
3502< 0.1%
 
30015< 0.1%
 

discount_amount
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count14
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.23934033868297241
Minimum0.0
Maximum250.0
Zeros60239
Zeros (%)98.8%
Memory size476.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum250
Range250
Interquartile range (IQR)0

Descriptive statistics

Standard deviation3.245894332
Coefficient of variation (CV)13.56183563
Kurtosis1123.700545
Mean0.2393403387
Median Absolute Deviation (MAD)0
Skewness26.8565106
Sum14600
Variance10.53583002
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
06023998.8%
 
52280.4%
 
101910.3%
 
201460.2%
 
25940.2%
 
50590.1%
 
10022< 0.1%
 
139< 0.1%
 
304< 0.1%
 
754< 0.1%
 
Other values (4)5< 0.1%
 
ValueCountFrequency (%) 
06023998.8%
 
31< 0.1%
 
52280.4%
 
101910.3%
 
139< 0.1%
 
ValueCountFrequency (%) 
2501< 0.1%
 
1502< 0.1%
 
10022< 0.1%
 
754< 0.1%
 
50590.1%
 

clean_up_cost
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count298
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.649710660480977
Minimum0.0
Maximum15309.0
Zeros59421
Zeros (%)97.4%
Memory size476.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum15309
Range15309
Interquartile range (IQR)0

Descriptive statistics

Standard deviation242.3751802
Coefficient of variation (CV)11.73746132
Kurtosis1015.272792
Mean20.64971066
Median Absolute Deviation (MAD)0
Skewness26.06415029
Sum1259653
Variance58745.72798
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
05942197.4%
 
801110.2%
 
400990.2%
 
40970.2%
 
120860.1%
 
200710.1%
 
160590.1%
 
320490.1%
 
240350.1%
 
280350.1%
 
Other values (288)9381.5%
 
ValueCountFrequency (%) 
05942197.4%
 
13< 0.1%
 
101< 0.1%
 
131< 0.1%
 
2017< 0.1%
 
ValueCountFrequency (%) 
153091< 0.1%
 
132121< 0.1%
 
131241< 0.1%
 
128941< 0.1%
 
92142< 0.1%
 

judgment_amount
Real number (ℝ≥0)

ZEROS

Distinct count503
Unique (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean347.89554105670396
Minimum0.0
Maximum15558.8
Zeros790
Zeros (%)1.3%
Memory size476.6 KiB

Quantile statistics

Minimum0
5-th percentile80
Q185
median250
Q3305
95-th percentile1130
Maximum15558.8
Range15558.8
Interquartile range (IQR)220

Descriptive statistics

Standard deviation460.0580427
Coefficient of variation (CV)1.322402815
Kurtosis103.5978064
Mean347.8955411
Median Absolute Deviation (MAD)165
Skewness6.606638626
Sum21221975.9
Variance211653.4027
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
851501724.6%
 
305907314.9%
 
250739212.1%
 
140681011.2%
 
580638310.5%
 
113041456.8%
 
8022913.8%
 
13015772.6%
 
23013332.2%
 
28010581.7%
 
Other values (493)59229.7%
 
ValueCountFrequency (%) 
07901.3%
 
503< 0.1%
 
522< 0.1%
 
5514< 0.1%
 
57.51520.2%
 
ValueCountFrequency (%) 
15558.81< 0.1%
 
133421< 0.1%
 
13263.81< 0.1%
 
13033.81< 0.1%
 
110304< 0.1%
 

grafitti_status
Categorical

MISSING

Distinct count1
Unique (%)< 0.1%
Missing58780
Missing (%)96.4%
Memory size476.6 KiB
GRAFFITI TICKET
2221
ValueCountFrequency (%) 
GRAFFITI TICKET22213.6%
 
(Missing)5878096.4%
 

Length

Max length15
Median length3
Mean length3.43691087
Min length3

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

ticket_idagency_nameinspector_nameviolator_nameviolation_street_numberviolation_street_nameviolation_zip_codemailing_address_str_numbermailing_address_str_namecitystatezip_codenon_us_str_codecountryticket_issued_datehearing_dateviolation_codeviolation_descriptiondispositionfine_amountadmin_feestate_feelate_feediscount_amountclean_up_costjudgment_amountgrafitti_status
0284932Department of Public WorksGranberry, Aisha BFLUELLEN, JOHN A10041.0ROSEBERRYNaN141ROSEBERRYDETROITMI48213NaNUSA2012-01-04 14:00:002012-01-19 09:00:0022-2-61Failure to secure City or Private solid waste collection containers and servicesResponsible by Default200.020.010.020.00.00.0250.0NaN
1285362Department of Public WorksLusk, GertrinaWHIGHAM, THELMA18520.0EVERGREENNaN19136GLASTONBURYDETROITMI48219NaNUSA2012-01-05 09:50:002012-02-06 09:00:0022-2-88(b)Allowing bulk solid waste to lie or accumulate on or about the premisesResponsible by Default1000.020.010.0100.00.00.01130.0NaN
2285361Department of Public WorksLusk, GertrinaWHIGHAM, THELMA18520.0EVERGREENNaN19136GLASTONBURYDETROITMI48219NaNUSA2012-01-05 09:50:002012-02-06 09:00:0022-2-43Improper placement of Courville container between collectionsResponsible by Default100.020.010.010.00.00.0140.0NaN
3285338Department of Public WorksTalbert, ReginaldHARABEDIEN, POPKIN1835.0CENTRALNaN2246NELSONWOODHAVENMI48183NaNUSA2012-01-05 10:25:002012-02-07 09:00:0022-2-88(b)Allowing bulk solid waste to lie or accumulate on or about the premisesResponsible by Default200.020.010.020.00.00.0250.0NaN
4285346Department of Public WorksTalbert, ReginaldCORBELL, STANLEY1700.0CENTRALNaN3435MUNGERLIVONIAMI48154NaNUSA2012-01-05 10:20:002012-02-14 09:00:0022-2-45Violation of time limit for approved containers to remain at curbside - early or lateResponsible by Default100.020.010.010.00.00.0140.0NaN
5285345Department of Public WorksTalbert, ReginaldCORBELL, STANLEY1700.0CENTRALNaN3435MUNGERLIVONIAMI48154NaNUSA2012-01-05 10:20:002012-02-14 09:00:0022-2-88(b)Allowing bulk solid waste to lie or accumulate on or about the premisesResponsible by Default200.020.010.020.00.00.0250.0NaN
6285347Department of Public WorksTalbert, ReginaldCORBELL, STANLEY1700.0CENTRALNaN3435MUNGERLIVONIAMI48154NaNUSA2012-01-05 10:20:002012-02-07 10:30:009-1-110(a)Inoperable motor vehicle(s) one- or two-family dwelling or commercial buildingResponsible by Default50.020.010.05.00.00.085.0NaN
7285342Department of Public WorksTalbert, ReginaldNICKOLA CORPORATION, W & H1605.0LIVERNOISNaN1382WHITEHOUSE CTROCHESTER HILLSMI48306NaNUSA2012-01-05 09:50:002012-02-07 09:00:0022-2-88(b)Allowing bulk solid waste to lie or accumulate on or about the premisesResponsible by Determination200.020.010.00.00.00.0230.0NaN
8285530Department of Public WorksBuchanan, DarylINTERSTATE INVESTMENT GROUP LL, .3408.0BEATRICENaN341HAMPTONGILBERTSC29054NaNUSA2012-01-05 11:30:002012-02-08 13:30:0022-2-88(b)Allowing bulk solid waste to lie or accumulate on or about the premisesResponsible by Default1000.020.010.0100.00.00.01130.0NaN
9284989Department of Public WorksBuchanan, DarylYAMAN, BATURAY8040.0SARENANaN43494ELLSWORTH # 20FREMONTCA94539NaNUSA2012-01-05 13:10:002012-01-25 13:30:0022-2-88(b)Allowing bulk solid waste to lie or accumulate on or about the premisesResponsible by Default500.020.010.050.00.00.0580.0NaN

Last rows

ticket_idagency_nameinspector_nameviolator_nameviolation_street_numberviolation_street_nameviolation_zip_codemailing_address_str_numbermailing_address_str_namecitystatezip_codenon_us_str_codecountryticket_issued_datehearing_dateviolation_codeviolation_descriptiondispositionfine_amountadmin_feestate_feelate_feediscount_amountclean_up_costjudgment_amountgrafitti_status
60991376482Buildings, Safety Engineering & Env DepartmentPierson, KevinNPML Mortgage Acquistion LLC c/o Home Servicing18827.0KLINGERNaN533Highlandia DriveBaton RougeLA70810NaNUSA2016-12-28 13:00:002017-01-23 10:30:009-1-83 - (Dwelling)Failure to obtain a lead clearance for rental property - one or two-family dwellingResponsible by Default500.020.010.050.00.00.0580.0NaN
60992376480Buildings, Safety Engineering & Env DepartmentPierson, KevinNPML Mortgage Acquistion LLC c/o Home Servicing18827.0KLINGERNaN533Highlandia DriveBaton RougeLA70810NaNUSA2016-12-28 12:30:002017-01-23 10:30:009-1-43(a) - (DwellinFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (1 or 2 fResponsible by Default500.020.010.050.00.00.0580.0NaN
60993376479Buildings, Safety Engineering & Env DepartmentPierson, KevinNPML Mortgage Acquistion LLC c/o Home Servicing18827.0KLINGERNaN533Highlandia DriveBaton RougeLA70810NaNUSA2016-12-28 12:15:002017-01-23 10:30:009-1-43(a) - (DwellinFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (1 or 2 fResponsible by Default500.020.010.050.00.00.0580.0NaN
60994376481Buildings, Safety Engineering & Env DepartmentPierson, KevinNPML Mortgage Acquistion LLC c/o Home Servicing18827.0KLINGERNaN533Highlandia DriveBaton RougeLA70810NaNUSA2016-12-28 12:45:002017-01-23 10:30:009-1-43(a) - (DwellinFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (1 or 2 fResponsible by Default500.020.010.050.00.00.0580.0NaN
60995376483Buildings, Safety Engineering & Env DepartmentPierson, KevinNPML Mortgage Acquistion LLC c/o Home Servicing18827.0KLINGERNaN533Highlandia DriveBaton RougeLA70810NaNUSA2016-12-28 13:15:002017-01-23 10:30:009-1-81(a)Failure to obtain certificate of registration for rental propertyResponsible by Default250.020.010.025.00.00.0305.0NaN
60996376496Buildings, Safety Engineering & Env DepartmentPierson, KevinTHE AIC GROUP12032.0SANTA ROSA48204P.O. BO969SouthfieldMI48037NaNUSA2016-12-29 09:30:002017-01-23 10:30:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Responsible by Default1000.020.010.0100.00.00.01130.0NaN
60997376497Buildings, Safety Engineering & Env DepartmentPierson, KevinTHE AIC GROUP12032.0SANTA ROSA48204P.O. BO969SouthfieldMI48037NaNUSA2016-12-29 09:50:002017-01-23 10:30:009-1-43(a) - (StructuFail to comply with an Emergency or imminent danger order concerining an unsafe or unsanitary structure or unlawful occupancy (all other structures, except buildings with five (5) or more stories)Responsible by Default1000.020.010.0100.00.00.01130.0NaN
60998376499Detroit Police DepartmentBOWLES, TIFFANIBARLOW, CHRISTOPHER D11832.0KILBOURNE4821311832KILBOURNEDETROITMI48213NaNUSA2016-12-29 14:30:002017-01-20 09:00:0022-2-45Violation of time limit for approved containers to remain at curbside - early or lateResponsible by Default100.020.010.010.00.00.0140.0NaN
60999376500Detroit Police DepartmentBOWLES, TIFFANIWILLIAMS, JASON11848.0KILBOURNE482134317YORKSHIREDETROITMI48224NaNUSA2016-12-29 15:00:002017-01-20 09:00:0022-2-45Violation of time limit for approved containers to remain at curbside - early or lateResponsible by Default100.020.010.010.00.00.0140.0NaN
61000369851Department of Public WorksJohnson, ValentinaLEONARD , KENNETH AND JEAN6100.0IRONWOOD4821071TYLERDETROITMI48203NaNUSA2016-08-31 11:05:002016-10-04 13:30:009-1-104Excessive weeds or plant growth one- or two-family dwelling or commercial BuildingResponsible by Default50.020.010.00.00.00.080.0NaN